Classification of Imbalanced Data Represented as Binary Features

نویسندگان

چکیده

Typically, classification is conducted on a dataset that consists of numerical features and target classes. For instance, grayscale image, which usually represented as matrix integers varying from 0 to 255, enables one apply various algorithms image tasks. However, datasets binary cannot use many standard machine learning optimally, yet their amount not negligible. On the other hand, oversampling such synthetic minority technique (SMOTE) its variants are often used if for imbalanced. since SMOTE synthesize new samples based original samples, diversity synthesized highly limited due poor representation features. To solve this problem, preprocessing approach studied. By converting into ones using feature extraction methods, succeeding methods can fully display potential in improving classifiers’ performances. Through comprehensive experiments benchmark real medical datasets, it was observed converted consisting better (maximum improvements accuracy F1-score were 35.11% 42.17%, respectively). In addition, confirmed synergistically contribute improvement performance.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

On Mining Fuzzy Classification Rules for Imbalanced Data

Fuzzy rule-based classification system (FRBCS) is a popular machine learning technique for classification purposes. One of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. However many cases the minority classes are more important than the majority ones. In this paper, we have extended ...

متن کامل

Mine Classification with Imbalanced Data

In binary classification problems it is common for the two classes to be imbalanced: one case is very rare compared to the other. Traditional classification approaches usually ignore this class imbalance, causing performance to suffer accordingly. In contrast, the algorithm infinitely imbalanced logistic regression (IILR) algorithm explicitly addresses class imbalance in its formulation. This p...

متن کامل

Extract minimum positive and maximum negative features for imbalanced binary classification

In an imbalanced dataset, the positive and negative classes can be quite different in both size and distribution. This degrades the performance of many feature extraction methods and classifiers. This paper proposes a method for extracting minimum positive and maximum negative features (in terms of absolute value) for imbalanced binary classification. This paper develops two models to yield the...

متن کامل

on mining fuzzy classification rules for imbalanced data

fuzzy rule-based classification system (frbcs) is a popular machine learning technique for classification purposes. one of the major issues when applying it on imbalanced data sets is its biased to the majority class, such that, it performs poorly in respect to the minority class. however many cases the minority classes are more important than the majority ones. in this paper, we have extended ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2021

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app11177825